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ABSTRACT 

We show that it is equivalent, for certain sets of finite graphs, to be definable in CMS 
(counting monadic second-order logic, a natural extension of monadic second-order 
logic), and to be recognizable in an algebraic framework induced by the notion of 
modular decomposition of a finite graph. 

More precisely, we consider the set Too of composition operations on graphs 
which occur in the modular decomposition of finite graphs. If T is a subset of Too, 
we say that a graph is an jF-graph if it can be decomposed using only operations 
in T . A set of .^-graphs is recognizable if it is a union of classes in a finite-index 
equivalence relation which is preserved by the operations in T. We show that if T is 
finite and its elements enjoy only a limited amount of commutativity — a property 
which we call weak rigidity, then recognizability is equivalent to CAf ^-definability. 
This requirement is weak enough to be satisfied whenever all ^-"-graphs are posets, 
that is, transitive dags. In particular, our result generalizes Kuske's recent result on 
series-parallel poset languages. 

Keywords: graph languages, logical definability, algebraic recognizability 

The connection between recognizability and definability is one of the cornerstones 
of theoretical computer science, going back to Biichi's celebrated theorem on finite 
and infinite words in the 1960s (see Pfj]). This theorem states the equivalence 
between two fundamental properties of a language: 

• to be definable in monadic second order logic (MS), 

• to be recognizable. 

In Biichi's work, recognizability is defined by means of a finite state automaton. It is 
well-known that recognizability by such an automaton is equivalent to algebraic rec- 
ognizability, that is, to being the union of classes in a finite-index congruence. (This 
is well-known for languages of finite words but it also holds, with the appropriate 
definitions, for languages of infinite words \22\ 123) 1. 

One direction in which this result has been extended, is the consideration of sets 
of other combinatorial structures than words. For instance, M S'-definability and 
recognizability are equivalent for trace languages (see [Hj). Traces can be viewed as 
finite posets, labeled by letters in a given alphabet A — and as such as generaliza- 
tions of words, which are >l-labeled linearly ordered finite sets. 

A further generalization is the consideration of finite graphs. Graph languages 
have been widely studied for the description of complex structures or behaviors; 
among them, languages of partially ordered sets (posets) are used for modeling 
certain aspects of concurrency. 

1 Partial support from the following sources is gratefully acknowledged: Project 2102-1 of IFC- 
PAR/CEFIPRA; and AS Automates, modeles distribues et temporises of Departement STIC, 
CNRS. 
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To handle graph languages, several definitions of algebraic recognizability can be 
adopted, depending on which operations on graphs (or constructors) are put for- 
ward (see Section |2J). In this paper, we consider the operations on graphs given by 
the theory of the modular decomposition of graphs, which we call the modular sig- 
nature (see Section |2~T|) . Courcelle [5] already considered this algebraic framework, 
and proved that CM S'-definability implies recognizability (Theorem 13. II below: see 
Section O about CMS vs. MS). Lodaya, Weil [THI EZj and Kuske [H] also con- 
sidered a restriction of this algebraic framework: to a particular class of graphs, 
the series-parallel posets, and to the modular operations which suffice to generate 
them, namely the sequential and the parallel product. In particular, Kuske proved 
[T4"] the equivalence of CMS'-definability and recognizability in that case. Our main 
result establishes the same equivalence in a wider setting: it holds for a language 
of finite ^-labeled graphs L provided the graphs in L can be generated from one- 
vertex graphs using a finite number of operations in the modular signature; and 
provided that, apart from the parallel product (or disjoint union) of graphs, these 
operations enjoy only a limited amount of commutativity — a property which we 
call weak rigidity (Section 14. 1(1 . As it turns out, this requirement is weak enough 
and it is satisfied by all the finite subsets of the modular signature which generate 
only posets or even dags. In other words, our result applies to languages of finite 
A-labeled posets under a natural finite generation hypothesis, thus generalizing 
Kuske's result. 

The proof, which generalizes ideas from proofs of Kuske an d Courcelle 0, 
relies on the consideration of certain tree-like normal forms for graphs (relative to 
the modular signature) and uses crucially the notion of an MS-definable transduc- 
tion 0. 

Other operations have been considered, to make the set of finite graphs into a 
(multi-sorted) algebra. Among the most important such signatures, we mention 
the VR and the HR signatures, see 0. In a series of papers {e.g. [3 El ED 03 E] ) > 
Courcelle and co-authors have studied the connection between definability and rec- 
ognizability with respect to these signatures. A remarkable result in that direction 
is the equivalence between CMS'-definability (see Section fOl about CMS vs. MS) 
and algebraic recognizability with respect to HR, for languages of graphs of tree- 
width bounded by an integer k (Courcelle for k = 2, Kaller 13 for k = 3 and 
Lapoire |15| for the general case). This result is, however, incomparable with ours. 

1. Terminology and notation 

In this section, we fix the notation and definitions which we will use, concerning 
graphs or posets and the logical apparatus to specify their properties. 

1.1. Graphs and posets 

In this paper, all graphs are assumed to be finite. 

We consider directed A-(vertex-)labeled graphs of the form G = (V, E, A) where 
V is the (finite) set of vertices, E C V x V is the edge relation and the labeling 
function A: V — > A is a mapping into a fixed alphabet A (a finite, non-empty set). 

When the labeling is irrelevant, we omit A in the description of G. Undirected 
graphs are considered as a special case of directed graphs, where the edge relation 
is symmetric: (x, y) € E if and only if (y, x) G E. 

We always assume that our graphs do not have self-loops (edges of the form 
(x,x)), that is, E is an anti-reflexive relation. If necessary, the presence of a self- 
loop at vertex x can be encoded in the letter labeling x. 

At times, we view graphs up to isomorphism, and at other times, we insist on 
so-called concrete graphs. More precisely, when we consider graph languages, the 
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graphs in question are up to isomorphism. When we use graphs as syntactic tools 
to define algebraic operations (as in Section 12.21) , then the name of vertices is im- 
portant, that is, distinct graphs may well be isomorphic. 

A dag, or directed acyclic graph is a directed graph G = (V, E) in which no path 
is a loop. The graph G is said to be transitive if the edge relation E is transitive. In 
particular, G — (V, E) is a transitive dag if and only if E is a partial order relation 
on V (minus the refiexivity part of the relation, that is, the pairs (x,x), x G V, 
since we are considering graphs without self- loops). When we talk of posets, we 
always refer to the associated transitive dags, so that a poset language is a special 
kind of graph language. 

1.2. Logics 

In this paper, we need to discuss logical properties of various kinds of relational 
structures, beyond labeled graphs and posets as defined above. For this purpose, 
we use the classical notions as in, say, 

In general, let S be a finite relational signature, that is, a finite set equipped with 
a mapping a : S — > N into the non-negative integers, called the arity function. An 
S-structure is a set X (the domain set of the structure) equipped, for each s £5, 
with a relation s x of arity a(s). 

When >S is fixed, logical formulas can be built using the usual connectives and 
quantifiers, and the elements of S as predicates (respecting the arity function a). 

Example 1 When we discuss A-labeled graphs in this paper, the signature consists 
of the binary relation E (edge relation) and, for each letter a £ A, of a unary relation 
A a . The graph is then, in effect, viewed as a structure with domain set its vertex 
set V. 

When quantification is allowed only on elements of the domain, we talk of first- 
order or _FO-formulas. If we quantify also on sets of elements (unary relations on 
the domain), we talk of monadic second order, or M S- formulas. 

We also make intensive use of the following extension of monadic second order 
logic. The formalism of A/5-formulas is enriched with special quantifiers of the form 
3 mod q x, where q > 2 is an integer and x is a first-order variable. A formula of the 
form 3 mod q x ip(x) is interpreted to mean that the cardinality of the set of values 
of x such that tp(x) holds is mod q. The resulting logic is called CMS (counting 
monadic second order logic) 2 . 

It is well-known that CMS is strictly more expressive than MS: no MS formula 
can express the fact that an S-structure has even cardinality [2]. On the other hand, 
CMS is designed precisely to express this type of property. 

2. Recognizability: the algebraic framework 

The notion of recognizability was established in the 1960s by Mezei and Wright 
20 . It makes sense with respect to a given algebraic framework, that is, in a given 
algebra, for a given signature. 

More precisely, let J 7 be a signature (finite or infinite), that is, a set equipped 
with a mapping a:f->N into the non-negative integers, called the arity function. 
An J- -algebra is a set X equipped, for each / 6 T , with an a(/)-ary operation / . 
Morphisms of ^-algebras are defined in the usual way, see pQ. A subset L of an 
^-"-algebra X is said to be J 7 -recognizable (recognizable if there is no ambiguity) if 
there exists a morphism of J-'-algebras (p from X into a finite J-"-algebra such that 
L = tp~ l ((p(L)). Thus, the notion of a recognizable subset of X depends on the 
algebraic structure considered on X. 
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There are several natural ways to view the set of all (finite) graphs as an algebra, 
and hence several different notions of recognizability (see for instance Courcelle 
in 0J|Sj). In this paper, we operate in the algebraic framework provided by the 
existence and the uniqueness of the so-called modular decomposition of finite graphs. 
The relevant definitions are given in the next sections. 

2.1. Composition of graphs: the modular signature 

If n > 1 is an integer, we denote by [n] the set {1, . . . , n}. 

With each n-vertex graph H, we associate an n-ary operation on graphs. In order 
to properly define our algebraic setting, operations must have a linearly ordered set 
of arguments, and hence we need to view H as a concrete graph with vertex set [n\. 
In particular, distinct isomorphic graph structures on [n] define different operations. 

Let H = ([n],F) and let Gi, . . . , G„ be graphs. The graph H{Gt, . . . , G„) is 
obtained by taking the disjoint union of the graphs Gi, . . . , G„, and by adding, for 
each edge (i, j) £ F, an edge from every vertex of Gi to every vertex of Gj. In other 
words, if G l = (Vi, Ei) for i = 1, . . . ,n, then H(G\, . . . , G n ) — (V, E) where 

V = Vi U • • • U V n 

E = Ei U • • • U E n U jj Vi x Vj 

The following 2-vertex graphs provide particularly important examples of such 
operations. 

1 2 1 2 1 , 2 

H® H, 

The binary operation defined by Hq, written G\ © G2, is simply the disjoint union 
of Gi and G2; it is sometimes called the parallel product of graphs. 

The binary operation defined by H, , written G\ • G2, is called the seguential 
product, and it consists of adding to Gi © G2 every edge from a vertex of Gi to a 
vertex of G2 . 

The binary operation defined by H®, written G\ ©G2, is called the cligue product, 
and it consists of adding to Gi © G2 every edge from a vertex of Gi to a vertex of 
G2 and every edge from a vertex of G2 to a vertex of G\ . 

It is immediately seen that these three operations are associative, and that the 
operations © and £g) are commutative. 

We also note the following compositionality property: if the graph H itself 
can be written as a composition, say, H = K(L\, . . . , L r ), then the composition 
H(Gx,...,G n ) = K(L[, . . . ,L' r ) and L'- = Lj(G ijA ,. . . , G ijT .) where i^i, . . . , i j>rj 
are the vertices of H in Lj. 

If H cannot be written as a composition, we say that H is prime: the composi- 
tionality property above implies that every composition operation can be expressed 
in terms of operations defined by prime graphs. 

Finally we note the following commutation properties: if H = ([n], F) and 
H' = ([n],F') are isomorphic graphs, then the corresponding composition opera- 
tions differ only by the order of the arguments. More precisely, if a is a permutation 
of [n] which induces an isomorphism from H' into H , then 

H(G 1 ,...,G n )=H'(G a(1) ,...,G a(n) ) (CP) 

Example 2 Let H and H' be the following concrete (prime) graphs: 
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1 3 5 5 3 1 




2 4 2 4 



The permutation (1 5) defines an isomorphism from H' to H and we have 
H(G\, G2, G3, G4, G5) = H (G5, G2, G3, G4, Gi). 

Similarly, the permutation (1 5)(2 4) defines an automorphism of H and we have 
H(G\, G2, G3, G4, G5) = -ff (G5, G4, G3, G2, Gi). 

In particular, we may restrict the set of concrete prime graphs defining compo- 
sition operations to having at most one representative of every isomorphism class: 
in the above example, every iJ'-product can be expressed as an _ff-product. How- 
ever, it remains necessary to retain a concrete presentation of H , in order to have 
a unequivocal linear order on the arguments of the corresponding operation. Note 
that this restriction does not eliminate the commutation properties (CP): each au- 
tomorphism of a prime graph induces one. 

In the sequel, we select a set Too (the modular signature) consisting of the binary 
operations ©, (g), •, and of the composition operations defined by a collection of 
graphs containing exactly one representative of each isomorphism class of prime 
graphs with at least three vertices. We will now view the class of finite graphs as 
an Too- algebra. 

It is important to observe that is infinite, since there are infinitely many 
isomorphism classes of finite prime graphs. In fact, almost all finite graphs are 
prime: more precisely, their relative frequency among n- vertex graphs tends to 1, 
see |5Tj . 

2.2. Modular decomposition 

The idea of the modular decomposition of a graph has been rediscovered a number 
of times in the context of graph theory and of other fields using graph-theoretic 
representations. We refer to |2J for a historical survey of this question, and to [T§] 
for a concise presentation. In this paper, we use the following definitions. 

Let G = (V, E) be a graph. A module in G is a subset X of V which interacts 
uniformly with its complement V \ X: more precisely, if v G V \ X and E contains 
a pair (v,x) with x G X, then {v} x X C E; and dually, if (x,v) G E for some 
x G X, the X x {v} C E. 

We say that a module X is prime if X ^ V and for every module Y, either 
Y C X or X C Y 01 X DY = One can show that the prime modules of a 
prime module X of G are prime modules of G. In addition, if V is finite, then the 
maximal prime modules of G form a partition of V. Let = be the corresponding 
equivalence relation on V and let H be the quotient graph H = Gj '=: its vertex set 
is V/ = and its edge relation is the image of E in the projection from V x V onto 
(V/ =) x (V/ =). Then one can show that H is either a prime graph H = ([n],F) 
with n > 3, or it is the transitive closure of one of the three following graphs (for 
n > 2): 

1 2 •- • • • n (n-element linear order) 

1 2 • • • n (n-element set) 



1 2 ••• n (n-element clique) 
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In particular, if the maximal prime modules of G are G±, . . . , G„, then exactly one 
of the following holds: 

G = H(G\, . . . , G n ) for some prime graph H with n > 3 vertices 
G = Gi • G 2 • • • • • G n 

g = d e g 2 e ■ • ■ © g„ 

G = Gi (g) G 2 (8) • • • ® G„ 

It follows that each finite graph can be constructed from singleton graphs, using 
operations from the modular signature. Such a description of a graph is called its 
modular decomposition. In other words, the class of all finite graphs is an Too- 
algebra generated by a single element. 

Moreover, the modular decomposition of a finite graph is unique up to the as- 
sociativity of •, ©, ®, the commutativity of ©, (g>, and the commutation properties 
(CP), based on the non-trivial automorphisms of prime graphs. Note that the 
modular decomposition of a graph can be computed in linear time |18l 1191 17). 

The above discussion has been entirely concerned with unlabeled graphs. The 
generators of the .Foo-algebra of A-labeled graphs are simply the A-labeled one- 
vertex graphs: in other words, the .Foo-algebra of A-labeled graphs is generated by 
A. 

Finally, if T C Too, we say that a (labeled) graph is an J 7 -graph if it is in the 
JF-algebra generated by the singleton graphs. 

Remark 1 If T contains only dags, then the JF-graphs are dags. If in addition, 
T consists only of transitive dags (that is, posets), then the ^"-graphs are posets. 
Conversely, every prime graph occurring in the modular decomposition of a dag 
(resp. a poset) is a dag (resp. a poset). 

Similarly, if T contains only undirected graphs (graphs with a symmetric edge 
relation), then the ^-graphs are all undirected. Conversely, every prime graph 
occurring in the modular decomposition of an undirected graph is undirected. 

2.3. Tree-like representations 

We will use the following tree-like representations of an A-labeled graph to account 
for its modular decomposition. 

We first consider the tree mdec(G) (Courcelle [S) Sec. 6]), whose set of nodes is 
the set of prime modules of G, and such that a node x is the parent of a node y if 
and only if y is a maximal prime module of x. Moreover, each leaf x of mdec(G) 
(necessarily a single vertex) is labeled by X(x) £ A, and each inner node of mdec(G) 
is labeled H (a prime graph in Too with at least three vertices), •, © or ®, according 
to the fact that x is an _ff-product, a ^-product, a ©-product, or a ©-product of its 
maximal prime modules. 

In particular, each ^-labeled node has at least 2 children, none of which is •- 
labeled; the analogous property holds for each ©-labeled node and for each ©-labeled 
node. Each if-labeled node has n children if H has n vertices. 

In addition to this tree structure, mdec(G) also encodes the following informa- 
tion. First, there is a linear order on the children of a "-labeled node x, which comes 
from the modular decomposition of x. There is no such order on the children of ffi- 
or ©-labeled nodes. The case of an i7-labeled node x (where H is an n-vertex prime 
graph in Too , n > 3) is intermediary: the modular decomposition of x provides an 
enumeration (that is, a linear order) of the n children of x, which is defined up to 
the action of Aut(-ff); more formally, the modular decomposition of x provides a 
collection of linear orders on the children of x, such that any of these order relations 
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can be mapped to any other one by some permutation a 6 Aut(H); that is, these 
linear orders form an orbit under the natural action of Aut(-ff). 

In view of the discussion in Section 12.21 this enriched tree structure uniquely 
defines G. 

Technically, we view mdec(G) as a relational structure whose domain is the set 
of prime modules of G, together with the following (interpreted) predicates: 

child(cc, y) if y is a maximal prime module of x, 

label a (x) (a G A) if x is an a-labeled vertex of G, 

label^ (x) if x is an Q)-product of its maximal prime modules, 

label(gi(a;) if x is an ^-product of its maximal prime modules, 

label, (a;) if x is a ^-product of its maximal prime modules, 

x < y if there exists a prime module z, with maximal prime modules 

z\, . . . , z n , such that z — Z\ • • • • • z n , x — Zi and y = Zj for some 

1 < i < j i < n. 

label// (x) (with H G Too a graph with n > 3 vertices,) if x is an 
H-product of its maximal prime modules, 

children// (x, yi, . . . , y n ) if H G Too has n > 3 vertices and x — 
H(yi, . . .,y n ). 

Note that if label//(x), a G Aut(Zf) and yx,...,y n are the children of H, then 
children// (x,y%, . . . ,y„) if and only if children// (x, y a {i), ■ ■ - ,2/o-(n))- 

As in Courcelle [5] and Kuske ^1], we also use the following representation, 
written mdec'(G), built from mdec(G) by adding internal nodes in such a way that 
every »-labeled node has exactly two children, the first of which is not •-labeled. 
More precisely, for each ^-labeled node u of mdec(G) with children Vx < ... < v n 
(n > 3), we add •-labeled nodes Ma, ... , u n -\ in such a way that the children of u 
are v\ < U2, the children of Ui are Vi < for 2 < i < n — 2, and the children of 
u n -\ are < v n . All other nodes (that is, all nodes that are not •-labeled) and 
relations are left unchanged. 

The nodes of mdec'(G) can also be identified with subsets of the vertex set of 
G, but not necessarily with prime modules of G. More precisely, with the above 
notation, the new vertex Ui can be identified with the union {J" l=i Vh- 

As a relational structure, mdec'(G) has domain the set of its nodes, and it is 
equipped with the following predicates, inherited from mdec(G): 

child(x, y), label a (a;) (a G A), label®(x), label^x), label.(x), 
label// (x) and children// (x, y%, . . . , y n ) if H G is a graph with n > 3 
vertices. 

Instead of the relation x < y between distinct children of a ^-labeled node, 
mdec'(G) is equipped with the binary predicate 

first-child(a;, y) if label.(a;), child(x, y) and y is the first (minimal, 
left-most) child of x. 

We say that a labeled tree of the form mdec'(G), for some A-labeled graph G, 
is an mdec' -tree. 

3. Recognizability vs. definability: known results 

The connection between definability and J r oc -recognizability was first studied by 
Courcelle A slight modification of Theorem 6.11] shows the following 

Theorem 3.1 Let J- be a finite subset of Too md let L be a language of A-labeled 
J- -graphs. Then the following are equivalent: 
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• L is Too -recognizable; 

• L is J 7 -recognizable; 

• the tree language mdec'(L) is CMS-definable in the class of mdec 1 -trees. 
Moreover, if L is CMS-definable (in the class of graphs), then L is J- -recognizable. 

Remark 2 Courcelle's result jS] Theorem 6.11] is actually more precise: it also 
proves the equivalence between .F-recognizability and definability in a certain ex- 
tension of CMS-logic, called MSu n , which is well-adapted to this situation but 
lacks the good algorithmic properties of MS and CMS logic. For our purpose, 
we do not need to get into the definition of MSu n , and it suffices to know that 
CMS-definability implies M 5/i„-definability. 

Another difference between the above statement and Courcelle's result is that 
the latter is given for unlabeled graphs and for particular values of T: namely the 
(finite) subset J- n of all graphs in with at most n vertices for some n. It is 
a routine verification that the same proof applies to A-labeled graphs and to any 
finite subset of ^"oo _ which is necessarily contained in some T n - 

Courcelle shows the equivalence between MS-definability, CMS'-definability and 
^-"-recognizability when the tree language mdec'(L) satisfies certain combinatorial 
properties [SJ Theorem 6.12], and especially when the out-degree of the internal 
nodes of the elements of mdec'(L) is uniformly bounded. 

This equivalence is also known to hold without restriction on the shape of the 
trees in mdec'(L) for certain small values of T . 

If T = {•} Since H, is a poset, the .F-algebra consists of posets, and it is easily 
seen that these posets are of the form [n], equipped with the usual linear order. The 
A-generated ^"-algebra is then naturally identified with the free semigroup A + , i.e., 
the set of all finite words on alphabet A and the setting of classical language the- 
ory. Theorems [O] (together with |S| Theorem 6.12], see above) reduces to Biichi's 
theorem on the equivalence between recognizability and MS- (and hence CMS-) 
definability. 

If T = {©} The A-generated .F-algebra consists of the finite A-labeled discrete 
graphs (graphs without any edges) . This algebra is naturally identified with A® , the 
free commutative semigroup on A. It is known (Courcelle |2j) that, for languages of 
discrete graphs, recognizability is equivalent to CM S'-definability, and not to MS- 
definability. In fact, MS-definability allows only the description of finite or cofinite 
discrete graph languages 

If T = {®} The A-generated ^-algebra consists of the finite A-labeled cliques. As 
this is the dual situation of discrete graphs (by edge-complementation), the same 
results hold. 

If T = {©, •} The graphs H, and if© are posets, and hence every ^-graph is a 
poset. These posets, called the series-parallel posets, are exactly those whose graph 
is iV-free Q31I27]. 

The languages of A-labeled ^-graphs, also called series-parallel languages or 
sp-languages, were studied by Lodaya and Weil |161 IT7] and by Kuske |14|. In 
particular, Kuske showed that for sp-languages, ^"-recognizability is equivalent to 
CMS'-definability [H Theorem 6.15]. 
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4. Weakly rigid signatures and CM 5-definability 

Our main result, Theorem l4.1l below. generalizes the results of the previous section: 
it asserts the equivalence between .F-recognizability and CM S'-definability for more 
general finite subsignatures J- of the modular signature, and in particular for every 
finite subsignature consisting only of dags. 

4-1. Weakly rigid signatures 

Let H = ([n],F) be a prime graph (n > 2). We say that H is weakly rigid if the 
automorphism group Aut(H) does not act transitively on [n\. That is: there are 
vertices i ^ j of H such that no automorphism of H maps i to j. 

Example 3 For each n > 2, the directed cycle of length n, C n , is not weakly rigid. 
Indeed, every cyclic permutation of [n] defines an automorphism of C n . The same 
holds for D n , the undirected cycle of length n (for n — 2 or n > 5: D3 and D4 are 
not prime. . . ). Note that D2 = H®. 

The graph H, is weakly rigid. The graph H from Example [5] is weakly rigid 
since it has a single non-trivial automorphism, namely (1 5) (2 4). In particular, no 
automorphism of H can map vertex 1 to vertex 2. 

This graph H is a particular case of a more general situation: every prime dag 
is weakly rigid, except for Hq. Indeed, in such a dag there are maximal elements 
(for the partial order relation obtained by taking the reflexive transitive closure of 
the edge relation) and not every vertex is maximal. The weak rigidity follows from 
the simple observation that every automorphism of a dag preserves the maximal 
elements. 

More generally, every prime graph in which the in-degree or the out-degree is not 
uniform, is weakly rigid. 

We say that a subset T of T<x, is a weakly rigid signature if T is finite, T contains 
at most one of the operations ® and ©, and every other operation in T is associated 
with a weakly rigid prime graph. 

Example 4 In view of Remark ^ an d Example every finite subsignature of T x 
consisting only of dags is weakly rigid. 

We now state our main theorem. 

Theorem 4.1 Let T he a weakly rigid signature and let L be a language of A-labeled 
J- '-graphs. Then L is CMS-definable if and only if L is J 7 -recognizable. 

Remark 3 Theorem 14.11 generalizes Kuske's result on sp-languages ^1], see Sec- 
tional It constitutes a refinement of Courcelle's theorem Theorem 6.11] (see Re- 
mark El, which only asserts the equivalence between ^"-recognizability and MSu n - 
definability. However, Courcelle's result does not assume that T is weakly rigid. 

In view of the importance of poset languages, it is worth stating the following 
particular case (see Example 0} of Theorem 14.11 

Corollary 4.2 Let T be a finite subset of J-oo such that every J- -graph is a poset 
(resp. a dag). A language of A-labeled J--posets is T -recognizable if and only if it 
is C M S -definable. 
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4-2. Proof of Theorem \4.1\ 

The proof of Theorem 14.11 given below, uses the notion of an MS-definable trans- 
duction introduced by Courcelle [2 Section 2] . The definition of these transductions 
is given in Section 14.31 together with the proof of the following theorem. 

Theorem 4.3 Let J- be a weakly rigid signature. The mapping which assigns to 
each A-labeled T -graph G the tree mdec'(G) is MS -definable. 

Note that Courcelle shows that if T is any finite subset of T^, , then the mapping 
which assigns to each linearly ordered A-labeled T- graph G the tree mdec'(G) is 
an MS-transduction [5| Corollary 6.9]. With our extra assumption on J 7 , we are 
able to dispense with the heavy requirement of considering only linearly ordered 
graphs. 

Proof of Theorem 14.11 Theorem 13.11 proves half of the equivalence; namely, it 
asserts that every CM S-definable language of A-labeled jF-graphs is ^"-recognizable 
(without assuming that T is weakly rigid). 

In order to prove the converse, we assume that L is an ^"-recognizable language of 
yl-labeled J-"-graphs. By Theorem l3.ll mdec'(L) is GMS-definable in the language 
of mdec'-trees. 

Now, the inverse image of a CM S'-definable subset by an MS'-transduction is 
CM S-definable 3, Corollary 2.7]. Thus Theorem 14. II is an immediate consequence 
of Theorem ED □ 



4.3. Proof of Theorem\4~3] 

We now fix a finite weakly rigid signature T. For convenience, we assume that 
(8> T\ the proof would be completely similar if we assumed that © T . 

Let us first explain how we use the hypothesis that T is weakly rigid: for each 
(concrete) prime graph H — ([n], F) £ J- with n > 3 vertices, we fix a proper, non- 
empty subset dist(-ff) of [n] of so-called distinguished vertices, which is preserved 
under the action of Aut(-ff). Such a set exists by assumption: we can choose an 
orbit of [n] under the action of Aut(if), or in the case of a dag the set of maximal 
vertices, etc. With this choice of dist(i7), we define in an mdec'-tree a new binary 
predicate dist-child^(a;, y), interpreted to mean that x is ii-labeled, there exist 
2/1, . . . , y n such that children#(x, yi, . . . , y n ), and y = yi for some i G dist(if). 

Thus, if G is an ^-graph, each node of mdec'(G) that is neither a leaf nor is 
labeled © has some distinguished children and some non-distinguished ones. By 
convention, the distinguished vertex of iJ, is the origin of the single edge — so that 
the distinguished child of a "-labeled node is its first child. 

Remark 4 We have seen that the order of children of an i/-labeled node is defined 
only up to the action of Aut(iJ): the notion of distinguished children is devised 
precisely to take into account this flexibility. Weakly rigid operations are precisely 
those for which some children can be designated unambiguously as distinguished. 

Remark 5 Note that, the set dist(i?) being fixed, dist-child^ is not truly a new 
predicate to be added in the signature of mdec'-trees, but rather an abbreviation 
for a first-order formula in the language of mdec'-trees. 

Since we are dealing only with finite signatures, we do not bother with a formal 
mechanism to choose the sets dist(i7). If we had to work with an infinite signature, 
it would be important to introduce a more formal definition of distinguished chil- 
dren, for instance based on a logical formula (on H ) describing these distinguished 
children. For instance, in the case of dags, one could always consider the maximal 
elements (or equivalently, the vertices of in-degree zero) . 
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Now we need to show that if G is an yl-labeled .F-graph, then mdec'(G) can be 
represented in G, its domain can be specified in the monadic second-order logic of 
graphs, and the predicates of the language of mdec'-trees can be specified in the 
same language. 

More precisely, following the definition in Section 2], we need to verify the 
following (complex) condition: There exist integers k, n > and M5-formulas in 
the language of graphs f(X), tpi(x, X), . . . , ipk(x, X), where X — {X\, . . . , X n ) is a 
vector of second-order variables called parameters; and, for each ^-ary predicate q 
in the language of mdec'-trees and each length I vector ?of integers in [0, k] there 
exists an M S- formula q x(xi, . . . , xg, X) with the following property. 

First we define, for each A-labeled JF-graph G and for each assignment 7 of values 
to the vector of variables X such that G satisfies 1/5(7), the structure repr^(G) by 
letting: 

• the domain of repr^(G) consists of the pairs (v, i) such that v E V, < i < k 
and G satisfies ipi(v,j); 

• if ? = (ii, . . . is a vector of integers in [0, k], q is an £-&ry predicate and 
(vi,ii), . . . , (vf ,i e ) are elements of the domain of repr^(G), then repr^(G) 
satisfies q((vi,ii), . . . , (ve,ie)) if and only if G satisfies q j{vi, . . . ,vg,j). 

The condition to be verified is finally that for each G, there exists an assignment 7 
such that repr^(G) is isomorphic to mdec'(G). 

For this purpose, we first encode the inner nodes of an mdec'-trec in its leaves. 
This idea was first introduced by Potthoff and Thomas [21], and used also in 
Section 5]. In fact, we cannot use a single encoding as in the works cited, and 
we construct a collection of four such encodings as in Kuske's |T3]. As it turns 
out, it is more convenient to define the inverse of these encodings: this is done in 
Section IOH 

This construction allows us to consider a structure isomorphic to mdec'(G), 
and defined within G in the form required by the definition of MS'-transductions 
(Section I4.3.2fl . It then suffices to verify that the domain of repr(G) and the 
predicates in this structure are expressible by means of M S-formulas on the graph 
G, which is done in Section 14.3.31 

We strongly rely on the fact that the nodes of the tree mdec'(G) are naturally 
viewed as subsets of V (see Section ET3*)l . and that V is both the vertex set of G and 
the set of leaves of mdec'(G). In particular, in the encodings we construct, each 
inner node is represented by a leaf of mdec'(G) and not by a pair of leaves as in 
Courcelle [H] or Kuske |14| . 

4-3.1. Encoding the nodes of an mdec'-iree 

Let T be an mdec'-tree. We partition its set N of nodes as follows: we let iVo 
be the set of leaves; N± be the set of ©-labeled nodes all of whose children are 
leaves; N2 be the set of ©-labeled nodes not in Ni; and N3 be the complement of 
A^o U N± U N2. That is, N3 consists of the •-labeled and the -ff-labeled nodes, where 
H G T has arity at least 3; in particular, the nodes in 7V3 have distinguished and 
non-distinguished children. 

Next we define mappings v (resp. fiQ, ^3) from N (resp. Nq, Ni, N 2 , N 3 ) 

to the powerset of Nq as follows. 

If x G No, we let v(x) = (j,q(x) = {x}. 

If x € Ni, we let v(x) = ni(x) be the set of children of x. 

IfxE N2, we let v{x) = (J v(y) where the union runs over the children 
y of x; and we let 112 (x) = (J /U3 (y) where the union runs over the children 
of x which are not leaves, and hence which are in N3. 
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If x G N 3 , we let v(x) = [J u{y) where the union runs over the non- 
distinguished children y of x; and we let fJ, 3 {x) = [J u(y) where the union 
runs over the distinguished children y of x. 

It is easily verified that these mappings are well-defined and that, for each x G N{ 
(i = 0, 1, 2, 3), v{x) and jii{x) are non-empty sets of leaves. 

For each leaf x of T we denote by p(x) the unique path from x to the root of T. 
The following lemma is a simple rewriting of the definition of v and the in . 

Lemma 4.4 Let i G {0, 1, 2, 3} ; let y G Ni be a node ofT and let x G N be a leaf. 
The following are equivalent: 

• ifi = 0, then x G v(y) iff x G y (y) iff x = y; 

• ifi = l, then x G v(y) iff ' x G fJ,i(y) iffy is the parent node of x, y is labeled 
© and all its children are leaves; 

• if i = 2 or i = 3, then x G v{y) iff y sits along p(x) and every node in 
N 3 between x and y along p(x) is reached from one of its non- distinguished 
children; 

• if i — 2, then x G H2(y) iff y sits along p(x), y is reached from one of its 
children in N 3 , say z, z is reached from one of its distinguished children, and 
every node in N% along p{x) and before z is reached from one of its non- 
distinguished children; 

• if i = 3, then x G Ps(y) iff y sits along p(x), y is reached from one of its 
distinguished children, and every other node in N 3 along p(x) and before y is 
reached from one of its non- distinguished children; 

• ifi — 2, then x G ^2(2/) iff % G Pz{z) for some child z of y in N3. 

It follows from this lemma that for each i, is the inverse image of a partial 
onto mapping Ki : N — > N{. More precisely, we have: 

• Ko(x) = x; 

• Ki(x) is the parent node of x — if that node is labeled © and all its children 
are leaves; 

• k 3 (x) is the first node y G N 3 along p(x) (starting from the leaf x) reached 
from one of its distinguished children — if there is such a node y; 

• K2(x) is the parent node of k 3 (x) — if k 3 (x) exists and its parent node is 
labeled ©. 

4-3.2. Representing an mdec'-tree in its leaves 

Let T be an mdec'-tree as above. Let repr (T) be the following structure, with 
the same signature as mdec'-trees. The domain of repr (T) is the set 

{(x,i) I x G A^ , < i < 3, Ki(x) is defined}. 

We let: 

• label a ((x, i)) if and only if i = and label a (a;) in T (for each a G A); 

• label® ((x, i)) if i = 1 or i = 2; 

• label. ((x, i)) if and only if i = 3 and label. (k 3 (x)) in T; 

• labels ((x, i)) if and only if i = 3 and labels (k 3 (x)) in T (where H is a prime 
graph in T with at least 3 vertices) . 

• child((x, i), (y,j)) if child(fi;j(x), Kj(y)) in T; 

• first-child((x, i), (y, j)) if first-child(fi;j(x), Kj(y)) in T; 
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• children ff ((x,i), {yi,ji), (y r Jr)) if children// (^(x), n n {yi), ■ K jr (y r )); 

• dist-childjj((a;, i), (y,j)) if dist-child/r(/tj(a;), Kj(y)) in T. 

Note that the mappings Ki are usually many-to-one, so that repr (T) is not 
isomorphic to T (and it is not an mdec'-tree). 

Let us say that two elements (x,i) and (y,j) of the domain of repr (T) arc =- 
equivalent if i = j and Ki(x) = Ki{y). It is easily verified that ii X , ■ ■ ■ , X 3 C N 
are such that X — {J i=0 (Xi x {i}) is a set of representatives of the =-classcs, then 
the restriction of repr (T) to X is isomorphic to T. This substructure of repr (T) 
(which depends on the choice of the Xi, but is unique up to isomorphism), is denoted 
— abusing notation — by repr(T). 

4.3.3. mdec'(G) is MS-definable 

We now consider the case where the tree T arises from the modular decomposition 
of an A-labeled .F-graph G = (V,E,X), T = mdec'(G). 

Recall that the set No of leaves of T is equal to V and that, more generally, the 
nodes of T are particular subsets of V. In view of their definition, the mappings Ki 
can be described as follows. 

Lemma 4.5 Let x e V. 

• K (x) = {x}. 

• If the least disconnected prime module P containing x is discrete, then m(x) = 
P; otherwise K\(x) is not defined. 

• If there exists a connected node P containing x such that x lies in a distin- 
guished child of P and, for every non-trivial connected node Q containing x 
and properly contained in P, x lies in a non- distinguished child of Q, then 
K3(x) = P; otherwise Ks(x) is not defined. 

• If Ks(x) is defined and the least node P properly containing it is disconnected, 
then n 2 {x) = P; otherwise n 2 {x) is not defined. 

We use the following collection of MS'-definable properties of an ^4-labeled T- 
graph G = (V, E, A). Upper-case letters X, Y, . . . represent subsets of V or second- 
order variables, and lower-case letters x, y, . . . represent elements of V or first-order 
variables. H = ([n], F) is a prime graph in T with n > 3. 
singleton(X, x) if X = {x}. 

label a (X), where a £ A, if X is an A-labeled node of mdec'(G), that is, 
X = {x} and \ a {x) for some x. 

partition(X, Xi, . . . , X n ) if (Xi, . . . , X n ) is a partition of X, that is, X is the 
disjoint union of the Xi and each Xi is non-empty. (To be completely correct, 
this predicate should be replaced by (n + l)-ary predicates partition^, for 
n = 2 and for each n such that an n-vertex graph lies in T\ furthermore, 
everyone of these predicates can be expressed in terms of partition 2 .) 

module(X) if X is a module of G. This is equivalent to 

Vy^X 3x e X E(x, y) \fx € X E(x, y) 
A 3x £ X E(y,x) =>\/x€X E(y,x). 

pmodule(X) if X is a prime module of G, that is, 

module(X) A (VY module(T) ^ {Y C X) V (X C Y) \< (X DY = 0)). 

disconnected(X) if X is disconnected, that is, 



3Y 3Z partition^, Y, Z) A (Vy eY\/zeZ ^E{y, z) A ->E(z, y)). 
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connected(X) if X is connected, that is -idisconnected(X). 

labels (X) if X is an (B-labeled node of mdec'(G), that is, a disconnected 
prime module. 

children?/ (X, X\, . . . , X„) if X,Xi,...,X„ are prime modules and X — 
H{Xi, . . . , X n ). The latter assertion is equivalent to 

partition (X, Xi , . . . , X„ ) 

A A Wxi £ Xi Vxj £ Xj E{xi,Xj ) 

(•,i)ep 

A f\ \fx t £ X t \/xj G Xj ->E(xi,Xj) 

labelii(Jf) if X is an H -labeled node of mdec'(G), that is, 
3X 1 ... 3X n children^ (X, Xi , . . . , X n ). 

suffix(X, Z) if Z is a non-trivial (sequential) suffix of X, that is, 

3Y partition(X, Y, Z) A (Vy G Y Vz G Z E(y, z) A ^E(z, y)). 

sequential(X) if X is a sequential product, that is, it has a non-trivial suffix. 

initial(X, Y) ifY is the first (least) prefix of X, that is, X \ Y is a suffix of X 
and Y itself is not sequential. 

label. (X) if X is a ^-labeled node of mdec'(G), that is, X is sequential and 
either it is prime module, or it is a suffix of a sequential prime module. 

node(X) if X is a node of mdec'(X), that is, X is either a singleton, or a 
•-labeled node, or an (B-labeled node, or an H-labeled node for some prime 
graph H G T with at least three vertices. 

child*(X, Y) if X and Y are nodes and X is an ancestor ofY in mdec'(X), 
that is, Y is properly contained in X. 

child(X, Y) if X and Y are nodes and X is a (the) minimal ancestor ofY. 

dist-child(X , Y) ifY is a distinguished child of X, that is child(X, Y) and, 
either X is ^-labeled and Y is the first prefix of X, or X is H-labeled (for some 
if 6 J with at least three vertices) and 

3Xi ... 3X n children H (X,X 1 ,...,X n ) A \J Y — Xi. 

iedist(H) 

Together with Lemma l4.5l this list of definable properties shows that the formulas 
X = kq(x), X = K\(x), X = K2{x) and X = Ks(x) can be expressed in monadic 
second-order formulas. 

We now verify formally that the mapping G \— > mdec'(G) is A/S-definable. 
As established in Section 14.3.21 it suffices to study the mapping G ^ repr(T) 
where T = mdec'(G). With reference to the definition given at the beginning of 
Section fOl we let k = 3 and n = 4, that is, the definition makes use of 4 parameter 
second-order variables Xq, . . . , X3, which will stand for sets of representatives of the 
=-classes among the domain elements of the form (x, 0), ... , (x, 3). 

Since repr(T) is defined for every G, the role of formula (f(Xo, . . . , X3) is solely 
to make sure that the assignment of values to the parameter variables is correct. It 
is chosen to express, for i = 0, 1, 2, 3, that Xi is contained in the domain of Ki, it 
does not contain distinct elements with the same Ki-image, and for each x, if Ki(x) 
is defined then there exists y £ Xi such that Ki(x) = Ki(y). 

For i £ [0,3], the formula ipi(x,X) is tpi = (x £ Xi). 

Finally, each relation q of arity r in the description of repr(T) is as in repr (T), 
and it can be A/5-defined using the list of properties given above. 
This concludes the proof of Theorem 14. 31 
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Conclusions 

We have proved the equivalence between .7 r -recognizability and CMS'-definability 
for a large class of finite subsignatures T of the modular signature . We have 
not however proved that this equivalence does not hold for the other subsignatures! 
In fact, Courcelle conjectured that CMS'-definability is strictly weaker than MSu n - 
definability for general graphs Conjecture 7.3]. One closely related, yet stronger 
question is to find out whether there exists a finite subset T C such that, 
for sets of J 7 - graphs, CAfS'-definability is strictly weaker than .F-recognizability 
(or than .Foo-recognizability, see Theorem 13.1(1 . Theorem 14.11 docs not solve this 
problem, it only designates a large class of finite signatures for which the two notions 
are equivalent. As pointed out by Courcelle, an archetypal setting to discuss this 
conjecture is given by cographs, that is, the ^"-graphs for T = {©, ©}• 

One can also investigate which natural .7-"- recognizable classes of ^-graphs are 
characterized by algebraic properties of the finite ^"-algebras recognizing them. This 
type of investigation is highly developed in the field of word languages (see |24| ) . 
but also of trace languages ^3E3|H], infinite word languages [23]. Lodaya and Weil 
showed, in this fashion, that the recognizable languages of series-parallel posets of 
bounded width are characterized algebraically (and in an effective fashion) 16 . 
Kuske studied the first-order definable languages of series-parallel posets, and gave 
an algebraic characterization for them in the bounded width case ^31 ■ The general 
(arbitrary-width) case remains open. 

Finally, one could ask for a model of automata to handle ^-"-recognizable sets 
of J 7 - graphs. Let us mention that |17| proposes a model of automata which can 
be used to process JF-graphs if T contains neither © nor ®. To be precise, the 
input for these automata is an mdec-tree or an mdec'-tree, but this distinction 
is not algorithmically crucial if we remember that such a tree can be computed in 
linear time from the graph itself (see |18l 1191 17)1. The accepting power of these 
automata matches exactly that of .F-recognizability. On the other hand, if © E J 7 , 
then the same automaton model can be used but it is strictly more powerful than 
.F-recognizability. Eliminating in this way the use of an associative commutative 
operation reduces the interest of the construction, and the question remains open to 
propose a different automaton model for ^"-graphs in general — or for series-parallel 
posets in particular, that is for the situation where T — {•,©}, studied especially 

in 0113. 
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